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(57) Abstract 

The invention provides a recombinant virus vector for use as an immunotherapeutic or vaccine. The recombinant virus vec- 
i tor comprises at least one pair of nucleotide sequences heterologous to the virus and which have sufncient sequence homology 
: that recombination between them might be expected. Tlie pair of nucleotide sequences are arranged m the virus vector such that 
I rhpv :\TP invertPd with r^^nQct tn earh other The vin(s vector is able to infea a mammalian host cell and express as polypeptide 
: the'heterologous nucleotide sequences in the host cell. For infection thought to be caused by HPV infection, the pair of nucleotide 
i sequences encode part or all of human papillomavirus (HPV) wild-type proteins or mutant protems immunologically cross-reac- 
■ tive therewith. For an immunotherapeutic or vaccine against cervical cancer, the recombinant virus vector encodes part or all of 
i the HPV wild-type proteins HPVI6E7 and HPVISE:- or mutant proteins immunologically cross-reactive therewith. 
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RECOMBINANT VIRUS VECTORS ENCODING HUMAN PAPILLOMAVIRUS PROTEINS 
This invention relates to recombinant virus vectors. 
In particular, it relates to recombinant virus vectors 
designed to overcome the problem of recombination 
5 between homologous nucleotide sequences. It also 
relates to recombinant virus vectors encoding human 
papillomavirus proteins; to immunotherapeutics and 
vaccines for conditions associated with HPV infection; 
to the production of a virus (e.g. vaccinia virus) 

10 engineered to express antigens encoded by human 
papillomavirus types 16 and 18 and to 
immunotherapeutics and vaccines for cervical cancer. 

In recent years, strong evidence has been adduced 
for a link between cervical carcinoma and infection 

15 with certain types of human papillomavirus (HPV), 
particularly with types 16, 18, 31, 33 and 35 (Gissman 
et al.. Cancer Cells 5,275, 1987), This is based on 
hybridisation studies which have indicated that more 
than 85-90% of biopsies from cervical tumours can be 

20 shown to contain papillomavirus DNA* HPV16 DNA is 
most commonly found (in about 60% of tumours) with 
HPV18 the next most frequent (about 20%) and the other 
types accounting for a further 5-10%. In many 
instances, tumour cells from the biopsies do not 

25 however , contain the complete genome , but rather a 
deleted form. The exrent and location of the deleted 
information within the virus genome is variable, but 
a general feature is the retention of the part of the 
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genome encoding "the E7 protein ( Schwarz et al . , Nature 
314, 111, 1985). In addition, the adjacent 

E6-encoding region is usually present. The ubiquitous 
presence of the E7-encoding region in tumour cells 
5 suggests that the protein product of this gene might 
play a role in the induction or maintenance of the 
transformed phenotype. Indeed in most cell lines 
established from tumour biopsies, expression of the E7 
gene can be detected (Smotkin & Wett stein, PNAS, 83, 

10 4680, 1986). Furthermore, it has been shovm that the 
E7 gene product can bind to the retinoblastoma (Rb) 
gene product, a recognised " ant i -oncogene" in normal 
human cells (Hunger et al., EMBO J. 8,4099, 1989). 
This strengthens the belief that E7 is directly 

15 involved in cell transformation. 

The presence and expression of the E7 and E6 
genes in tumour cells derived from cervical carcinoma 
biopsies, suggests the possibility that these proteins 
could be potential targets for the immunological 

20 recognition of the tumour cells. It is well known 
that viral proteins produced inside mammalian cells 
can be processed through a host cell pathway to short 
peptides, which then form a complex with host Major 
Histocompatibility Complex ( MHC ) Class 1 molecules and 

25 are transported to the cell surface. These complexes 
may then present a target for recognition by the host 
immune system. Interaction of the complex with the 
receptor molecule on the surface of cytotoxic T cells 
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(the T cell receptor) can then lead to activation of 
the T cells to proliferate or to destroy the 
recognised cell. It is possible, therefore, that the 
presence in the body of a population of cytotoxic T 
5 lymphocytes (CTLs) which are capable of recognising 
cells expressing the HPV E6 and/or E7 proteins could 
afford protection against the development and 
proliferation of cei^ical tumours. Indeed it has been 
reported that normally oncogenic mouse cells 
10 engineered to express the HPV E7 protein are unable to 
form tumours in mice which have been previously 
immunised with non-tumorigenic E7-expressing cells, 
and that this rejection is mediated by CD8 + 
lymphocytes (CTLs) (Chen et al., PNAS 88, 110, 1991). 
1 5 Further , the generation of an active population of 
such cells subsequent to tumour initiation could 
result in regression of the tumour. 

There are numerous reports on the construction of 
recombinant viruses eg vaccinia viruses containing, 
20 and expressing foreign genes (Mackett & Smith, J. gen. 
Virol. 67,2067, 1986), and several reports of the use 
of these recombinant viruses to generate effective 
immune responses against the expressed foreign 
antigens. A particular advantage of this route for 
25 delivery of antigens for vaccination is that it may 
lead to the development of cellular as well as humoral 
immunity. This is because the foreign proteins will be 
produced inside cells of the infected individual in a 



wo 92/16636 




PCr/GB92/00424 



4 

manner similar to that which occurs during natural 
infection. This means that they should be processed 
through the correct pathway to allow generation of a 
CTL response. In several cases, it has been 
5 demonstrated directly that immunisation with the 
recombinant virus is capable of producing a cellular 
immune response in the form of foreign 
antigen-specific CTLs (Moss & Flexner, Ann. Rev. 
Immunol., 5,305, 1987). Furthermore, vaccination of 

10 animals with recombinant vaccinia viruses expressing 
certain tumour-specific antigens, such as the human 
melanoma-associated antigen P97 ( Estin et al . , PNAS, 
85, 1052, 1988), the bovine papillomavirus E7 protein 
(Meneguzzi et al.. Vaccine, 8, 199, 1990) and the 

15 human breast cancer-associated antigen ETA (Hareuveni 
et al., PNAS, 87, 9498, 1990) has been demonstrated to 
result in the induction of immunity against tumour 
initiation and progression. 

The present applicants have recognised the 

20 desirability of producing a recombinant virus vector 
which is useful as an imraunotherapeutic or vaccine for 
conditions caused by HPV infection, for example for 
cervical cancer. With respect to cervical cancer, the 
art at the time of the applicants making the present 

25 invention recognised the E7 gene as having the 
potential to immortalise cells. Therefore, it would 
be felt inappropriate to incorporate the E7 gene in an 
immunotherapeutic . The applicants however , have 
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recognized the surprising usefulness of including the 



recognized that the beneficial effects to be gained by 
treatment with an immuno therapeutic comprising the E7 
5 gene are likely to outweigh by far any risk associated 
with the oncogenic activity of the E7 gene. Thus, an 
aspect of the applicants invention involves the use of 
a recombinant virus vector which expresses an E7 gene, 
as an immunotherapeutic or vaccine. Furthermore, the 

10 applicants provide embodiments of their invention in 
which these risks are reduced still further by 
specific alteration of the gene sequences in order to 
reduce the oncogenic potential of the E7 gene without 
compromising its ability to stimulate an appropriate 

1 5 immune response . 

The present applicants have also recognized that 
where a number of HPV proteins which may be encoded by 
different HPV strains are implicated as being 
associated with a particular HPV-associated condition 

20 (for example, cervical carcinoma, HPV16 nd HPV18; 
genital warts, condyloma acuminata, respiratory 
papillomatosis, HPV6 and HPVll; squamous cell 
carcinoma in immunosuppressed individuals, HPV5 and 
HPV8 ) , rather than produce a plurality of recombinant 

25 viruses engineered separately to express each of the 
implicated proteins, it would be advantageous to 
produce a single virus recombinant which is able to 
express part or all of the sequences of more than one 



E7 gene in an immunotherapeutic. 



They have also 
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of the proteins. Thus, with respect to cervical 
cancer, rather than produce four recombinant viruses 
engineered separately to express each of the potential 
targets for immunological recognition of cervical 
5 tumour cells i.e. the HPV16 E6, HPV16 E7, HPV18 E6 and 
HPV18 E7 proteins, it would be especially advantageous 
to produce a single virus recombinant which is able to 
express the part or all of the sequences of more than 
one of the proteins, preferably at least two of the 

10 proteins and most preferably all four proteins. That 
the present applicants are able to achieve this is 
particularly surprising. This is because, the coding 
sequences for many HPV proteins are highly homologous 
to other equivalent HPV proteins ( for example from 

15 other virus strains). Thus, the HPV16 E6 and HPV18 E6 
proteins show overall homology of 62% and comprise 
regions of very high homology . The same is true for 
HPV16 E7 and HPV18 E7 which show overall homology of 
57%, with particular regions of very high homology - 

20 This means that one would expect recombination to 
create problems such as loss of gene sequences. The 
applicants have, however, devised a novel strategy 
designed to minimise the likelihood of such 
recombination events and to circumvent the deleterious 

25 effect of those events should they indeed arise. 
Thus, surprisingly, the invention provides recombinant 
virus vectors which comprise at least one pair of 
nucleotide sequences which have sufficient sequence 
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homology that recombination between them might be 



sequences may encode part or all of human 
papillomavirus (HPV) wild- type proteins or mutant 
5 proteins immunologically cross-reactive therewith. In 
particular, the invention provides a recombinant 
vector which can maintain stably, and express, part or 
all of four of the desired gene sequences from HPV16 
and HPV18. 

10 Thus^ the present invention provides a 

recombinant virus vector for use as an 
immunotherapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
virus and which have sufficient sequence homology that 

15 recombination between them might be expected wherein 
said pair of nucleotide sequences are arranged in said 
virus vector such that they are inverted with respect 
to each other to reduce the likelihood of 
recombination events leading to loss of part or all of 

20 said sequence and said virus vector is able to infect 
a mammalian host cell and express as polypeptide the 
heterologous nucleotide sequences in said host cell. 
The at least one pair of nucleotide sequences may 
encode part or all of human papillomavirus (HPV) wild- 

25 type proteins or mutant proteins immunologically 
cross-reactive therewith. The pair of nucleotide 
sequences may encode parr or all of the protein E7 
from both HPV16 and HPV18 or functional equivalents 



expected . 



The at least one pair of nucleotide 
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thereof. The pair of nucleotide sequences may encode 
part or all of the proteins E6 from both HPV16 and 
HPV18 or functional equivalents thereof. 

The recombinant virus vector may comprise a 
5 further pair of nucleotide sequences heterologous to 
said virus and which ( i ) have sufficient sequence 
homology that recombination between them might be 
expected wherein said further pair of nucleotide 
sequences are arranged in said virus vector such that 
10 they are inverted with respect to each other and said 
virus vector is able to infect a mammalian host cell 
and express as polypeptide the further pair of 
heterologous nucleotide sequences in said host cell. 

The further pair of nucleotide sequences may 
15 encode part or all of HPV wild- type proteins or mutant 
proteins immunologically cross-reactive therewith. 

For example, the present invention also provides 
a recombinant virus vector which in addition to the E7 
coding sequences, also comprises and is adapted to 
20 express genetic sequences encoding part or all of the 
protein E6 from both HPV16 and HPVIB or functional 
equivalents thereof. The genetic sequences may 
comprise sequences encoding HPV16 E6/E7 and HPVIB 
E6/E7 as shown in figures 1(a) and 1(b) 
25 respectively. 

The genetic sequences may encode an antigenic 
moiety of the said proteins. 

Either or both of the nucleotide sequences in a 
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pair of nucleotide sequences may be altered to make 
them less homologous than an equivalent pair of 
nucleotide sequences encoding wild-type HPV proteins. 
The alteration in nucleotide sequence may be in an 
5 area of high sequence homology. Preferably, the 
alteration in nucleotide sequence will not result in 
an alteration of the encoded amino acid sequence. 

Two or more nucleotide sequences each encoding 
separate proteins may be fused together to form a 
10 single open reading frame. Thus the genetic sequences 
encoding part or all of the proteins E6 and E7 from 
HPV16, may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the proteins E6 and E7 from HPV18, may be fused 
15 together to form a single open reading frame. The 
genetic sequences encoding part or all of the proteins 
E6 and E7 from both HPV16 and HPV18, may be fused 
together to form a single open reading fraime. Thus, 
the recombinant virus vector may have the pairs of 
20 nucleotide sequences arranged according to any one of 
the options shown in Figure 26. Where the recombinant 
virus vector comprises an open reading frame having a 
fused genetic sequence encoding part or all of the 
proteins E6 and E7 from HPV16, and a separate open 
25 reading frame having fused genetic sequences encoding 
part or all of the proteins E6 and E7 from HPV18, the 
two open reading frames may be inverted with respect 
to one another. For example, the two open reading 
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frames may be arranged in the recombinant: virus vector 
adjacent to each other. The inversion may be such 
that the E6 coding sequences of HPV16 and HPV18 are 
both located between the E7 coding sequences of HPV16 
5 and HPV18. Alternatively, the inversion could be such 
that the E7 coding sequences of HPV16 and HPV18 are 
both located between the E6 coding sequences of HPV16 
and HPV18. In particular the two open reading frames, 
each with its respective promoter, may be arranged 
10 next to each other in the recombinant vector. In this 
case the promoters may be located between the genes, 
which are transcribed outwardly, or the promoters may 
be located outside the genes, which are transcribed 
inwardly. 

15 Similarly, the genetic sequences encoding part or 

all of the E7 protein from HPV16 and the E7 protein 
from HPV18 may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the E6 protein from HPV16 and the E6 protein 

20 from HPV18 may be fused together to form a single open 
reading frame. This leads to another range of 
arrangements similar to those shown in Figure 26. The 
fusions may be via a single codon encoding a 
relatively small neutral amino acid e.g. glycine. 

25 Thus the present invention also provides a 

recombinant virus vector which comprises a first open 
reading frame having a fused genetic sequence encoding 
part or all of the wild-type proteins E6 and E7 from 
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HPV16; and a separate second open reading frame having 
a fused gene-tic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; wherein the 
first and second open reading frames may be inverted 
5 with respect to one another whereby either: i) the E6 
coding sequences of HPV16 and HPV18 are both located 
between the E7 coding sequences of HPV16 and HPV18; or 
ii) the E7 coding sequences of HPV16 and HPVIB are 
both located between the E6 coding sequences of HPV16 
10 and HPV18; and wherein any of said wild-type proteins 
may be replaced by a mutant protein immunologically 
cross-reactive therewith. 

Each of the first and second open reading frames 
may have a corresponding promoter and the two open 
15 reading frames each with its promoter, are arranged 
next to each other in the virus. 

The present invention also provides a recombinant 
virus vector wherein either: i) the promoters are 
located between the first and second reading frames 
20 whereby the open reading frames are transcribed 
outwardly; or ii ) the promoters are located outside 
the first and second open reading frames whereby the 
open reading frames are transcribed inwardly. 

The present invention also provides a recombinant 
25 virus vector which comprises a first open reading 
frame having a fused genetic sequence encoding part or 
all of the wild-type proteins E6 and E7 from HPV16; 
and a separate second open reading frame having a 
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fused genetic sequence encoding part: or all of the 
wild-trype proteins E6 and E7 from HPV18; wherein the 
E6 coding sequences of HPV16 and HPV18 are both 
located between the E7 coding sequences of HPV16 and 
5 HPV18; and each open reading frame has a corresponding 
promoter, the promoters being located between the 
first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and wherein 
any of said wild-type proteins may be replaced by a 
10 mutant protein immunologically cross-reactive 
therewith . 

The wild-type proteins HPV16E7 and HPV18E7 may be 
replaced with mutant proteins which are substantially 
homologous to said wild-type proteins and in which the 
15 residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine 
residues . 

The recombinant virus vector may be derivable 

20 from vaccinia virus. 

The applicants have also recognized that for 
effective function as an immunotherapeutic, it is 
desirable for the recombinant virus to retain its 
ability to replicate and thereby generate an active 

25 infection in order that a cellular immune response may 
be mounted against the virus-encoded proteins. Thus, 
the applicants propose thar the foreign gene sequences 
should be inserted into the vector virus at sites, the 
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disruption of which by -the insertrion of the 
heterologous gene sequences will not substantially 
interfere with, and therefore have a substantially 
adverse affect on any viral functions which relate to 
5 the replicative ability of the virus in the infected 
host animal. The applicants have named these sites 
'neutral sites* (although the term 'neutral' should 
not be interpreted strictly as it is acknowledged that 
the disruption of these sites may have a small, but 
10 relatively speaking inconsequential adverse affect on 
replicative ability). 

DNA sequences which affect virus replication can 
fall into several categories: 
i) protein coding sequences; 

15 ii ) elements involved in control of gene expression; 
and iii) elements involved in virus DNA replication 
A non-essential and neutral insertion site must 
therefore avoid such regions, and, such sites have 
been identified on 
20 the basis of nucleotide sequencing studies. Thus the 
genetic sequences may be inserted into neutral sites 
within the virus genome. One or more genetic 

sequences may be inserted into the same neutral site. 

Neutral sites can be easily tested for according 
25 to techniques well known in the art. For example, a 
site may be selected, interrupted or deleted using 
standard methodologies and the resultant recombinant 
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virus placed in conditions which normally support: 
growth of the wild type virus vector, to assess the 
effect of the manipulations. The pathogenicity of the 
virus may be further compared with that of the 
5 unmodified virus vector strain in animal models, in 
order to assess its level of attenuation. 

In the present invention, the virus vector may be 
vaccinia virus. The vaccinia virus may be attenuated 
or disabled so that it is unable to fully replicate 
10 and esablish an extensive infection of host cells. 

Vaccinia virus has been used extensively in the 
past for vaccination against smallpox, and its use 
worldwide has led to the complete eradication of the 
disease (Bhebehami, Microbiol. Rev,, 47, 455, 1983). 
15 During the World Health Organisation (WHO) campaign to 
eradicate smallpox, several different strains of 
vaccinia virus were used as vaccines. In 1984 a 
meeting was sponsored by the WHO to discuss the use of 
vaccinia virus as live virus vectors (Bulletin of the 
20 WHO 63( 3 ): 471-477 ) . The data in this report indicates 
that the number of complications associated with 
vaccination was lowest for the Wyeth strain of 
vaccinia virus, and so this strain has been chosen as 
a basis for the construction of the recombinant virus 
25 according to an embodiment of the present invention. 

It is well known that insertion of foreign DNA 
into the genome of vaccinia virus at certain favoured 
sites, such as the thymidine kinase gene locus, can 
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reduce dramatically the ability of the virus to 
replicate in vivo. As discussed above, the aim of the 
therapeutic approach described here is to generate an 
active in vivo infection, so that a cellular immune 
5 response may be mounted against the virus encoded 
proteins . The present invention provides a method for 
inserting foreign genes at neutral sites within the 
genome of a virus, the disruption of which sites by 
the insertion will not interfere with and therefore 
10 have a substantially adverse affect on virus 
replication . 

Where the virus is vaccinia virus, the neutral 
site may be identified herein within the Wyeth strain 
of vaccinia virus on the basis of the related WR 

15 strain nucleotide sequence. Alternatively, where 
other vaccinia virus strains are used, sites 
equivalent to those sites identified above may be 
used. The neutral sites may be any as identified 
hereinafter as A,B,C and D, or a functional 

20 equivalent. 

For successful expression of foreign proteins by 
the recombinant virus vector the foreign genes must be 
placed under the control of a promoter sequence which 
is operable by the virus. Thus the recombinant virus 

25 vector may comprise a single promoter which controls 
the expression of all the heterologous genetic 
sequences within a single open reading frame. 
Alternatively, where the recombinant virus vector 
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encodes more than one open reading frame containing 
heterologous genetic sequences, the virus may comprise 
a first promoter which controls the expression of the 
genetic sequences from a first open reading frame, and 
5 one or more further promoters which control the 
expression of the genetic sequences from one or more 
further open reading frames. The promoter sequence 
may be virus- specific and several have been 
characterised so far (Davison & Moss, J. Mol* Biol,, 

10 210, 749, 1989; Davison & Moss, J. Mol, Biol., 210, 
771, 1989). The single promoter and the first and one 
or more further promoters may be the p 7.5 promoter . 
There have been reports that the induction of foreign 
antigen-specific CTLs requires expression of the 

15 antigen early in the virus replication cycle ( Coupar 
etal., Eur. J. Immunol., 16, 1479, 1986). Therefore, 
a recombinant virus as provided by the present 
invention may involve the use of the p7.5 promoter 
(Venkatesan et al.. Cell, 125, 805, 1981) and/or the 

20 H6 promoter (Rosel et ai., J. Virol, 60, 436, 1988), 
both of which are active both early and late in 
infection . 

As mentioned earlier, it has been reported that 
the E7 gene on its own has the potential to 
25 immortalise cells (Phelps et al . , Cell 53, 539, 1988). 
In an embodiment of the present invention, the 
strategy for expression of the protein involves 
production of E7 as a fusion protein with E6, which is 
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unlikely to retain biological function. Embodiments of 
the invention provide for reducing this risk still 
further, by making changes within the E7 gene which 
are known to destroy its oncogenic capacity ( Chesters 
5 et al., J. Gen Virol. 71, 449. 1990). Thus in the 
recombinant virus vectors of the present invention, 
the genetic sequences encoding part or all of the E7 
proteins may be altered from the equivalent wild type 
sequences, in order to render the sequences, used in 

10 the recombinant virus vectors less oncogenic than 
their equivalent wild type sequences. 

The present invention also provides 
pharmaceuticals comprising recombinant virus vectors 
as herein defined. The pharmaceutical may be for use 

15 against a condition caused by HPV infection which 
comprises an immunotherapeutically effective amount of 
a recombinant virus vector. The pharmaceutical may be 
for use against cervical cancer . 

The pharmaceutical may be a vaccine to immunise 

20 against a condition caused by HPV infection which 
comprises an amount of recombinant virus vector as 
herein provided which when administered to a recipient 
can specifically activate cells of the immune system 
to HPV proteins. The vaccine may be for iimnunisation 

25 against ceirvical cancer. 

The pharmaceuticals may comprise one or more 
excipients . The present invention also provides 
methods of using the recombinant virus vectors as 
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herein defined to make medicaments for use as 
immunotherapeutics or vaccines against conditions 
thought to be caused by HPV infection. For example 
for the prophylaxis and treatment of cervical cancer • 
5 The present invention also provides methods of 

treating mammalian patients with recombinant virus 
vectors and pharmaceuticals as herein provided. 

The present invention also provides a method of 
deteirmining a neutral site in a virus vector, the 
10 disruption of which by the insertion of heterologous 
gene sequences will not interfere with, and therefore, 
have a substantially adverse affect on viral function 
which relates to the replicative ability of the virus. 
The method for this determination comprises: (a) 
15 analysing a viral genome to identify open reading 
frames which are likely to encode functional genes, 
by looking for expected codon usage between spaced 
apart start and stop codons; and (b) selecting sites 
which are not in such open reading frames, likely to 
20 encode functional genes, as identified in (a) This may 
include selecting sites between open reading frames 
for sequences of functional genes and selecting sites 
which are in open reading frames which have some 
functional gene characteristics, such as an expected 
25 codon usage, but have lost other essential 
characteristics such as a start codon. The method may 
also comprise interrupting or deleting the selected 
sites from the viral genome and placing the resultant 
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virus in conditions which normally support growth of 
the wild type virus. 

The present invention also provides neutral sites 
identified by use of the above methods. 
5 The present invention provides an embodiment 

which shows a way of inducing a cellular immune 
response against the papillomavirus proteins usually 
expressed in cervical tumour cells by the creation of 
a recombinant vaccinia virus, which has been 

10 engineered to produce the HPV E6 and E7 proteins, or 
proteins containing HPV E6 and E7 sequences, during 
its replication cycle- This therapeutic vaccinia 
virus contains the E6 and E7 genes from both HPV16 and 
HPV18, the viruses most commonly associated with 

15 cervical carcinoma. Vaccination with this single 
virus may thus stimulate immunity to the E6 and E7 
proteins of the HPV types associated with more than 
80% of cervical tumours. Expression of all four gene 
sequences (e.g. HPV16 E6 and E7; HPV18 E6 and E7 ) in 

20 a single virus however presents a problem, because of 
the likelihood of loss of genetic sequences through 
recombination. The present invention provides a 
method for circumventing this difficulty, firstly 
through specific sequence alteration, in order to 

25 reduce sequence homology and secondly through their 
insertion into the vaccinia virus genome in such a way 
that if such recombination were to occur, it would not 
lead to loss of sequences (i.e. in inverted 
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orientation with respect to each other). Expression 
of the desired four gene sequences in the vaccinia 
virus genome could also be difficult (though not 
impossible) to achieve as independent expression 
5 units, and so the invention provides that instead, the 
E6 and E7 open reading frames may be fused together. 
A problem with standard methods for insertion of 
foreign information into the vaccinia virus genome is 
that the use of selectable markers to increase the 

10 efficiency of recombination results in the ultimate 
presence in the recombinant virus also of the 
selectable marker gene itself. Methods for insertion 
have been developed however, which allow subsequent 
elimination of these extraneous sequences ( Falkner & 

15 Moss J. Virol., 64, 3108, 1990) and these are used in 
an embodiment of the present invention to ensure that 
the final recombinant vaccinia virus has only those 
additional sequences which are necessary for its 
required function . 

20 In order that the present invention is more fully 

understood an embodiment will now be described in more 
detail with reference to the figures in which: 

Figure 1(a) shows the nucleotide sequence and 
three-frame translation of HPV16 E6/E7 polymerase 

25 chain reaction product ( underlined regions indicate 
the E6 and E7 coding sequences); Figure 1(b) shows the 
nucleotide sequence and three- frame translation of 
HPV18 E6/E7 polymerase chain reaction product 
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(underlined regions indicate the E6 and E7 coding 
sequences ) ; 

Figure 2 shows the cloning and modification of the 
HPV16 and HPV18 E6 and E7 genes; 
5 Figure 3 shows an open reading frame plot of vaccinia 
virus from positions 17201-18450 of the region covered 
by the four fragments SalF,G,H and I; short vertical 
lines denote termination codons, lines topped with 
boxes denote initiation codons, rectangles show 

10 relevant open reading frames, and arrows show 
direction of upper and lower DNA strands; 
Figure 4 shows an open reading frame plot of vaccinia 
virus from positions 21001-22000 of the region covered 
by the four fragments SalF,G,H and I; short vertical 

15 lines denote termination codons, lines topped with 
boxes denote initiation codons, rectangles show 
relevant open reading frames and arrows show direction 
of upper and lower DNA strands; 

Figure 5 shows an open reading frame plot of vaccinia 
20 virus from positions 23501-25000 of the region covered 
by the four fragments SalF,G,H and I; short vertical 
lines denote termination codons , lines topped with 
boxes denote initiations codons, rectangles show 
relevant open reading frames and arrows show the 
25 direction of upper and lower strands of DNA; 

Figure 6 shows a codon usage plot of vaccinia virus 
from positions 17201-18450 of the region covered by 
the four fragments SalF, G,H and I; arrows show 
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direction of each DNA strands- 
Figure 7 shows a codon usage plot of vaccinia virus 
from positions 21001-22000 of the region covered by 
the four fragments SalF,G,H and I; arrows show 
5 direction of each DNA strand; 

Figure 8 shows a codon usage plot of vaccinia virus 
from positions 23501-25000 of the region covered by 
the four fragments SalF,G,H and I; arrows show 
direction of each DNA strand; 
10 Figure 9 shows the DNA sequence around site A showing 
translations in single letter amino acid code of genes 
SalF 17R and SalF 19R; 

Figure 10 shows the DNA sequence around site B showing 
translations in single letter amino acid code of genes 
15 SalF 20R and SalF20.5R; 

Figure 11 shows a comparison of the SalG2R open 
reading frame to the yeast guanylate kinase gene 
sequence; 

Figure 12 shows the DNA sequence around site D showing 
20 translations in single letter amino acid code of genes 
HindB3R and Hind B4R; 

Figure 13 shows the cloning of vaccinia virus ( Wyeth 
strain) neutral sites; 

Figure 14 shows the cloning of vaccinia virus promoter 
25 sequences; 

Figure 15 shows the construction of vaccinia 
promoter-driven E6-7 cassettes- 
Figure 16 shows the cloning of the E6-7 cassette into 
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vaccinia virus (Wyeth strain) neutral sites; 
Figure 17 is a diagram showing the recombination 
required to generate the final therapeutic vaccinia 
virus - HPV recombinant virus; 
5 Figure 18 shows the synthetic oligonucleotides used in 
the construction of the therapeutic vaccinia virus HPV 
recombinant ; 

Figure 19 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 17201-18450 of the 
10 region covered by the four fragments SalF,G,H and I; 
Figure 20 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 21001-22000 of the 
region covered by the four fragments SalF,G,H and I; 
and 

15 Figure 21 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 23501-25000 of the 
region covered by the four fragments SalF,G,H and I. 
Figure 26 shows a variety of options for arrangment of 
HPV16E6 and E7 and HPV18E6 and E7 coding sequences in 

20 a recombinant virus vector. 

All cloning procedures are carried out according 
to the protocols described in "Molecular Cloning*', A 
Laboratory Manual, eds. Sambrook, Fritsch and 
Maniatis, Cold Spring Harbor Laboratory Press, 1989. 

25 All plasmids on which site directed mutagenesis is 
performed are of the "phagemid" type, which may be 
converted to single-stranded DNA by superinfection 
with the bacteriophage f 1 . Preparation and site 
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directed mutagenesis of single- stranded DNA, is 
carried out as described by Brierley et al . , Cell, 57, 
537, 1989. The sequence of all the synthetic 
oligonucleotides used are provided in figure 18. 

5 Preparation of the E6 and E7 genes from HPV16 and 
HPV18 for insertion into vaccinia virus 
Cloning of the HPV16 and HPV18 E6 and E7 
A fragment of DNA containing the HPV16 E6/7 coding 
region is prepared by polymerase chain reaction (PCR) 

10 amplification from the plasmid pBR322/HPV16 (Durst et 
al., PNAS, 80, 3812, 1983) using the oligonucleotides 
SOS and S06. A fragment containing the same region 
from HPV18 is prepared by the same procedure from 
plasmid pBR322/HPV18 (Boshart et al . , EMBO J. 3,1151) 

15 using the oligonucleotides SOI and S02 . Plasmids 
PBR322/HPV16 and pBR322/HPV18 are both available from 
Behringwerke AG, P-0. Box 1140 D-3550, Marburg, 
Germany (alternatively the necessary sequences can be 
created synthetically from the sequence information 

20 provided by the present application). 

In each case, this produces a DNA fragment of 
about 800 base pairs (bp) with a site for the 
restriction enzyme Nco 1 ( CCATGG ) located exactly at 
the beginning of the E6 gene, and a Smal site 

25 immediately downstream of the termination codon for 
the E7 gene ( figure 1(a) and ( b ) ) . The products are 
then digested with Ncol and Smal, and cloned into 
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Ncol-Smal digested plasmid pUCllSNS (a modified 
version of the "phagemid" pUCllS { Viera & Messing, 
Methods Enzymol,, 153,3, 1987) in which Ncol and Smal 
sites have been created by site-directed mutagenesis 
5 within the poly-linker region) to generate the plasmid 
plMS7, containing the HPV16 sequences, and pIMS8 
containing the HPV18 sequences (figure 2). The use of 
pUCllS is not crucial to the present strategy as any 
plasmid which can be manipulated by site directed 
10 mutagenesis can be successfully used. 

Fusion of the E6 and E7 QRFs 

For insertion into vaccinia virus, the E6 and E7 genes 
from each HPV type, are first fused together to form 
a single continuous ORF. This is achieved by 

15 site-directed mutagenesis as follows: 

(i) The termination codon TAA of HPV16 E5 in pIMS7 is 
altered using the oligonucleotide S20 to the sequence 
GGAA. This is in order to convert the normally 
separate ORFs for HPV16 E6 and E7 into a single ORF 

20 (PIMS7.1 - figure 2). 

( ii ) The termination codon TAA of HPV18 E6 in pIMS8 is 
altered using the oligonucleotide S21 to the sequence 
GGAA. This is in order to convert the normally 
separate ORFs for HPV18 E6 and E7 into a single ORF 

25 (pIMSS.l - figure 2). 



Abolition of the immortalising potential of E7 
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In order to destroy the immortalising properties of 
each of the E7 proteins, two key codons within the 
HPV16 E7 coding sequence, (cys24 and glu26 - figure 
1(a)) and the equivalent codons from HPV18 E7 (cys27 
5 and glu29 - figure Kb)), are altered to glycine 
residues by site directed mutagenesis as follows. 

(i) The sequence of the E7 gene is altered in pIMS7 
to encode glycine at codons 24 and 26 (normally 
encoding cysteine and glutamate respectively, using 

10 oligonucleotide S22 (pIMS7.2 - figure 2). 

(ii) The sequence of the E7 gene is altered in pIMS8 
to encode glycine at codons 27 and 29 (normally 
encoding cysteine and glutamate respectively, using 
oligonucleotide S23 (pIMSS.lB - figure 2). 

15 Reduction in intertypic recombination potential of 
HPV16 and HPV18 E6 and E7 sequence and elimination of 
potential vaccinia virus transcription termination 
signal 

A potential difficulty with the presence of both HPV16 
20 and HPVIB E6 and E7 specific DNA within the genome of 
a single virus, is that recombination between the two 
sets of related sequences could lead to loss or 
rearrangement of information such that expression of 
the required proteins is disrupted. The invention 
25 provides ways of minimising this risk. Firstly, by 
inserting the two sets of genetic information in the 
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vaccinia genome in opposite orientation to each other 
(so that recombination will result not in the loss of 
sequence information, but rather in its inversion). 
Secondly, by creating specific changes in the E6/7 
5 sequence of one of the HPV virus strains at sites 
where the homology is greatest. These changes however 
are made in such a way that the amino acid coding 
potential of the genes remains unaltered- 

The HPV18 E6 sequences is therefore altered by 
10 site-directed mutagenesis as follows: 

The sequence TTTTTATTCTAGAATTAGAG ( which begins 
210 nucleotides from the start of E6 - underlined 
in figure Kb)) is mutated, using oligonucleotide 
S24 to the sequence TTTCTACAGTAGAATCAGAG (pIMS8.2 
15 - figure 2) (changed nucleotides are in bold 

type) . 

A second aim of this change is to eliminate from 
the HPV18 E6 sequence, the sequence TTTTTAT, which is 
a potential termination signal for the early vaccinia 
20 virus transcription enzyme ( Rohrmann et al . , Cell., , 46, 
1029, 1986). 

Source and propagation of vaccinia virus 
The Wyeth strain of vaccinia virus is used for 
construction of the therapeutic virus. It is propagated 
25 in Vero cells for the purposes of genetic manipulation, 
and in the human diploid fibroblast cell line MRC5 for 
the production of the final therapeutic virus stock. 
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Both cell lines are obtained from the National Institute 
of Biological Standards and Control, South Miras, U.K. . 
The Wyeth strain of vaccinia virus, vero cells and the 
cell line MRC5 are also available from the American Type 
5 Culture Collection, 12301 Parklawn Drive, Rockville, 
Maryland 20852, USA. 

Identification and cloning of neutral sites from the 

vaccinia virus 

Description of neutral sites 
10 For the purpose of insertion of papillomavirus genes 

within the vaccinia virus genome, sites have been chosen 

to have two characteristics. 

Firstly, they should be non-essential regions, i.e. 

insertion of foreign genes at these points, will not 
15 disrupt any functions of the vaccinia virus to the 

extent that the virus can no longer grow in tissue 

culture. 

Secondly, they should be neutral sites, i.e. 
insertion of foreign genes all these points , will not 

20 increase or decrease the level of attenuation of the 
vaccinia virus . 

The difference between these two factors can be 
seen by looking at the thymidine kinase ( TK ) gene of 
vaccinia virus. It is a non-essential region, and hence 

25 viruses with genes inserted in the TK gene can grow well 
in tissue culture ( Mackett at al . , J- Virol., 49, 857, 
1984). However, such viruses have been found to be 
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greatly attenuated in vivo (Duller et al . , Nature 317, 
813, 1985), For the purpose of prophylactic 

vaccination, such increased attenuation might be 
desirable. However, for an immunotherapeutic strategy 
5 where the danger from the disease to be treated clearly 
outweighs the risk of vaccine associated complications, 
use of an attenuated virus is considered undesirable 
since it could compromise the immunological response to 
the papillomavirus antigens. Hence, the applicants have 
10 identified sites which they judge will not attenuate the 
virus any further, and have termed them * neutral sites* . 
Such sites have been identified within the virus genome 
by careful analysis of the DNA sequence of the WR 
strain. The WR strain was originally derived from the 
15 Wyeth strain by passage in mouse brain. Therefore the 
two strains are closely related. The nucleotide 
sequence of three regions of the WR genome which contain 
the selected neutral sites are shown in figures 19,20 
and 21. Four neutral sites (A-D) have been chosen on 
20 the above discussed criteria as follows: 
Site A: gap between SalF17R and SalF19R 
Site B: gap between SalF19R and SalF20.5R 
Site C: in SalG2R, a potential non-functional gene 
Site D: in HindB3.5R, a potential non-functional gene 
25 These sites (A-D) can be identified by the following 
stretches of DNA sequence, each of which is 40 
nucleotides in length. 

A CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT 
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B TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA 
C TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA 
D TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA 
In other strains these sequences may vary, while still 
5 having substantial homology with those given above. In 
particular a site may have at least 90%, more preferably 
95%, homology with the sequence given above. 
Figures 3-5 show the distribution of initiation codons 
and open reading frames (ORFs) in the regions of the 
10 vaccinia virus genome shown in figures 19,20 and 21. 
Figures 6-8 show the same regions with a plot showing 
to what extent each reading frame conforms to the 
pattern of codon usage expected for vaccinia genes. A 
graph of codon usage is plotted for each of the three 
15 possible reading frames in each direction (Staden, R., 
Nucl. Acids Res., 12, 521, 1984; Staden, R. , Nucl. Acids 
Res., 12, 551, 1984). In these codon usuage plots, the 
short vertical bar lines extending from the horizontal 
axes represent start codons. The longer vertical bar 
20 lines placed above the horizontal axes represent stop 
codons. This sort of plot is a useful way of helping to 
determine whether a particular ORF is a genuine vaccinia 
gene. Where there is a likely genuine gene, the graph 
of codon usuage will rise between a start codon and a 
25 stop codon. For example, in figure 7, it can be seen 
that the graph of codon usage rises over the region of 
the SalG2R ORF ( the dotted line shows that this frame 
conforms most of the expected codon usage). For the 
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other two frames the graphs show that they do not 
conform to vaccinia codon usage . The peak labelled 
*part of gk ' , and marked with a dashed line, also 
conforms well to vaccinia codon usage. In summary, a 
5 genuine gene must start with an initiation (start) 
codon, end with a termination (stop) codon, and should 
conform well to vaccinia codon usage along its length. 
In most cases the conformation to the vaccinia codon 
usage drops off sharply outside the gene. 
10 The neutral sites are further described as follows 
Site A. Gap between SalF17R and SalF19R 

Site A is marked on figures 3 and 6. Figure 9 shows the 
actual DNA sequence with a translation of the ORFs on 
either side of the site. It can be seen that Site A is 

15 placed in an intergenic region between SalF17R and 
SalF19R. It is placed some 195 bases upstream of 
SalF19R to avoid any promoter elements associated with 
that gene. The sequence TTTTTCT (shown in italics) will 
act as a tei*minator of early RNA transcription for the 

20 SalF17R gene if it is an early gene. However, the site 
is placed downstream of the first of these, so it will 
not affect early termination of transcription if it 
occurs. Examination of figure 6 shows that there is no 
recognisable gene on the opposite strand at this point, 

25 and hence this sequence location is suitable as a 
neutral insertion site. 

Site B, Gap between SalF19R and SalF20.5R 

Site B is marked on figures 3 and 6- Figure 10 shows 
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the actual DNA sequence with a translation of the ORFs 
on either side of the site. It can be seen that Site 



SalF20,5R. Figure 6 shows that it is within a region 
5 of high vaccinia codon usage, but that this region does 
not form a genuine gene, having no initiation codon. 
In addition, figure 6 suggests that Sal20-5R is not a 
complete gene, as the conformation to vaccinia codon 
usage drops off dramatically at the start of the gene. 
10 In the event that SalF20.5R is a genuine gene. Site B 
is placed some 70 bases upstream of SalF20.5R which may 
well avoid any promoter elements associated with that 



located in approximately 35 bases upstream of the start 
15 of the gene. ) In addition SalF20R has no TTTTTNT (N-any 
nucleotide) transcription termination signal with which 
Site B could interfere. Hence this sequence location 
is suitable as a neutral insertion site. 

Site C. Within SalG2R, a potential non-functional gene 
20 Site C is marked on figure 4. The ORF SalG2R has 
considerable similarity to the guanylate kinase ( GK ) 
gene of yeast- This similarity is shown in figure 11. 
Sequence upstream of the SalG2R ORF (but in a different 
frame) has been added on to SalG2R, to see if the match 
25 to GK extends beyond the boundaries of the original open 
reading frame. The match appears to extend beyond the 
5' end of the SalG2R ORF. In particular, an important 



B is placed in an intergenic region between SalF19R and 



gene. 



(Note: many vaccinia promoter elements are 
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site in the yeast GK gene, the ATP/GTP binding site 
( shovm underlined) only matches in the out of frame 
sequence upstream of the SalGZR ORF. Hence, it is very 
likely that the SalG2R gene is not active as a guanylate 
5 kinase and can be referred to as a *pseudogene* * If the 
gene is inactive as the applicants deduce, then it will 
serve as a neutral insertion site. 

Site D. Within HindB3>5R, a potential non- functional 
gene 

10 Figure 5 shows that site D lies within the region 
designated HindB3,5R. This region, although conforming 
to vaccinia codon usage, has no start codon and is 
therefore not a genuine gene. The codon usage plot 
shown in figure 8 indicates that it probably was once 

15 a functional gene, and may well have been attached to 
HindB3R (a shift in the codon usage preference occurs 
here well away from the termination codon of the HindBSR 
ORF which suggests that the last section of HindBSR is 
not properly part of this gene. ) Hence it is likely 

20 that HindB3.5R is not active as a gene and can be used 
as a neutral insertion site. Figure 12 shows the actual 
DNA sequence with a translation of the ORFs on either 
side of the site. It can be seen that site D is placed 
in an intergenic region between HindB3R and HindB4R as 

25 well as being within the non-functional HindB3.5R- 

Preparation of vector for cloning of neutral sites 

In order to insert foreign genetic information into the 
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neutral sites described above, DNA copies of the neutral 
sites, together with an appropriate amount of flanking 
DNA from the vaccinia genome (approximately 500 bases 
on either side) must first be cloned into a plasmid 
5 vector. These plasmids may then be used to introduce the 
foreign DNA into the vaccinia virus genome; the vaccinia 
virus ' flanking sequences * around the inserted gene 
serve to allow homologous recombination between the 
plasmid DNA and the viral DNA, with the consequent 
10 insertion of the foreign gene at the desired location. 



Cloning of neutral site sequences 

Plasmids containing flanking regions from the neutral 
sites are constructed as follows. DNA is prepared from 
the Wyeth strain of vaccinia virus by the method of 

15 Esposito et al., (J. Virol. Meth. 2: 175, 1981). The 
polymerase chain reaction (PGR) is used to remove an 
approximately 1000 base pair (bp) fragment from DNA of 
the Wyeth strain of vaccinia virus. Pairs of 

oligonucleotides are chosen approximately 500 bp either 

20 side of the chosen neutral site. These oligonucleotides 
are based on the sequence of the WR strain, but are 
chosen in regions where the sequence of the WR strain 
is identical to that of the Copenhagen strain ( Goebel 
et al.. Virology 179:247, 1990). The oligonucleotides 

25 incorporate restriction enzyme recognition sequences so 
that they can be cloned easily into a plasmid. For 
neutral sites A,B and D the restriction sites are EcoRl 
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and Hindlll. For neutral site C the Hindlll site is 
replaced by an Sphl site, since there is an internal 
Hindlll site in the chosen flanking sequences. 
The oligonucleotides used for PGR are listed below: 
5 Site A leftMB 16 

Site A rightMB 17 

Site B leftMB 24 

Site B rightMB 25 

Site C leftMB 18 
10 Site C rightMB 19 

Site D leftMB 22 

Site D rightMB 23 



DNA fragments of approximately Ikb are then prepared 
using these pairs of oligonucleotides by PGR 
15 amplification, digested with EcoRI and Hindlll (for site 
A,B and D) or with EcoRI and Sphl (for site C) and 
cloned into Hindlll and EcoRI -digested pUG118 (figure 
13) to generate the plasmids pIMMG7a, pIMMG7b, pIMMC7c 
and pIMMC7d. 

20 Creation of unique restriction sites for insertion at 
the neutral sites 

A suitable restriction enzyme site is then introduced 
at the selected location within each of the plasmids. 
This is achieved using site directed mutagenesis using 
25 an oligonucleotide conraining the desired new unique 
site and flanked by 15 bases of sequence to either side 
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(see below). The plasmids modified in -this fashion are 
designated pIMMC8a-d ( figure 13 ) . 



original oligonucleotidesite new plasmid 
plasraid introduced 



pIMMC7a MB35SnaBl pIMMCSa 

pIMMC7b MB36Hpal pIMMCb 

pIMMC7c MB37Stul pIMMCSc 

pIMMC7d MB38SnaBl pIMMCSd 



10 Cloning of the vaccinia virus early/late promoter 
sequences 

The p7 . 5 and H6 promoters from vaccinia virus genomic 
DNA are prepared by PGR amplification as described 
below. 

15 A pair of complementary oligonucleotides (S7 and S8 ) is 
synthesised to include the following restriction enzyme 
sites, Hindlll, SnaBl, Hpal, Hindlll, Sail, Ncol, Smal, 
SnaBl and EcoRl, such that the pair, after annealing, 
present at one end Hindlll compatible overhanging ends, 

20 and at the other, EcoRl compatible overhanging ends. 
The two oligonucleotides are allowed to anneal and are 
inserted into pUCllB cut with EcoRl and Hindlll (figure 
14). The resulting vector is called pIMMC3. 

A DNA molecule of approximately 180 bp containing 

25 the H5 promoter is removed from the WR strain of 
vaccinia virus by PGR amplification using the 
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oligonucleotides MB15 (anneals upstream and includes a 
5 '-Sail site) and MB7 (anneals downstream and includes 
a 5*-HindIII site). This is cloned into pIMMC3 cleaved 
with Hindlll and Sail to create plMMC4a (figure 14). 
5 A DNA molecule of approximately 200bp containing the 
p7,5 promoter is then removed from the WR strain of 
vaccinia virus by PGR amplification using the 
oligonucleotides MB32 (anneals upstream and includes a 
5 '-Sail site) and MB33 (anneals downstream and includes 
10 a 5'Ncol site). This is cloned into pIMMC3 cleaved with 
Ncol and Sail to create pIMMC14b. 
Construction of the therapeutic virus 

The strategy required to generate a recombinant vaccinia 
virus containing and expressing the E6-E7 proteins from 
15 HPV16 and HPV18, based on the elements described above 
involves five main stages as outlined below, 
i ) Cloning of the modified E6-7 genes downstream of 
vaccinia early promoter sequences 

A DNA fragment containing the modified HPV16 E6-7 
sequence is excised from pIMS7.2 by digestion with 
Hindu I and Smal, and cloned into Hindlll and 
Hpal-digested pIMMC4a to generate pIMS12 (figure 
15 ) . 

A DNA fragment containing the modified HPV18 E6-7 
sequence is excised from pIMS8,2 by digestion with 
Ncol and Smal, and cloned into Ncol and 
Smal-digested pINTMC14b to generate pIMS14 ( figure 
15 ) . 



20 



25 
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±± ) Preparat:±on of a plasmid vector cont:aininQ bo1:h 
HPV16 and HPV18 E6-7 sequences t:oge1:her with their 
upstream vaccinia promoters . 

A DNA fragment containing the HPV18 E6-7 region 
5 together with the upstream p7 . 5 promoter is 

excised from pIMS14 with Sail and Smal and 
inserted into Sail and Smal-digested pIMS12 to 
generate pIMSlS (figure 15) 

iii) Insertion of the HPV E6-7/promoter "double" 
10 cartridge into the neutral site containing 

plasmids , 

A DNA fragment containing both the HP VI 6 and HP VI 8 
E6-7 coding regions together with their upstream 
promoter elements is excised from pIMSlS with 
15 SnaBl and inserted into the appropriately-digested 

neutral site-containing plasmids pIMMC7a-d. This 
step is shown in figure 16, and the resulting 
plasmids are designated pIMMC9a-d- 

iv) Introduction via homologous recombination of the 
20 neutral site DNA, together with the intervening HPV 

sequences, into the vaccinia virus genome to create 
a recombinant virus expressing the two modified HPV 
E6-7 sequences , 

The recombinant plasmids pIMMC9a-d are purified 
25 and allowed to recombine into vaccinia (figure 17) 

using standard protocols ( Mackett et al . , in D.M. 
Glover ( ed ) DNA Cloning: a Practical Approach, 
Oxford and Washingron DC, IRL Press, 1985). 
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Viruses which have acquired the HPV sequences are 
identified by probing with radiolabelled HPV 
specific sequences. Viral plaques are lifted onto 
nitrocellulose (Villareal and Berg, Science 196, 
5 183, 1977) and probed with radiolabelled Ncol-Sraal 

fragment from pIMS14 containing the HPV18 E67 
gene. Recombinant viruses are then isolated from 
the agarose overlay and plaque purified three 
times. They are checked for the presence of the 
10 appropriate DNA sequences by Southern blotting of 

purified virus DNA using DNA probes derived from 
the HPV E6 and E7 genes, and for expression of the 
appropriate sequences by western blotting using 
antisera specific for the HPV E6 and E7 proteins, 

15 Cloning of the therapeutic virus in MRC5 cells 

Stocks of the final recombinant virus are prepared 
by growth in Vero cells, and are used to infect 
MRC5 cells deemed suitable for the preparation of 
material suitable for use as human vaccines. The 

20 virus is plaque-purified three times by standard 

methods, and finally a stock prepared for clinical 
use . 

Confirmation of presence of the correct HPV DNA 
insert 

25 A sample of this stock virus is checked once again 

for the presence of correctly configured virus 
DNA, and for expression of the correct virus 
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proteins. Figure 22 shows the analysis by PCR, of 
a recombinant vaccinia virus (v9a.l) in which the 
HPV DNA cassette is inserted at Site A. The 
diagram shown in panel ( a ) indicates the DNA 
5 fragments expected if insertion of the correct DNA 

has occurred. It can be seen in panel (b) that 
the actual pattern of PGR products generated is 
consistent with that expected. 



Confirmation of expression of the HPV DNA insert 

10 The recombinant viruses are then checked for 

expression of the expected HPV proteins. An 
example- of this analysis is shown in Figure 23. 
Vero cells are infected with with recombinant 
virus v9a.l (HPV DNA inserted at Site A), and the 

15 cells examined by western blotting for the 

presence of the HPV E67 fusions proteins using 
monoclonal antibodies specific for the HPV16 E7 
protein (camvirS) and for the HPV18 E7 protein 
( 7E10 ) • It can be seen that both monoclonal 

20 antibodies recognise specifically proteins of the 

expected size in cells infected with the 
recombinant virus v9a.l, but not in cells infected 
with the control parent virus Wyeth strain. These 
recognised proteins co-migrate with proteins 

25 synthesised by in vitro translation of mRNA 

encoding the expected HPV fusion proteins (HPV 15 
E67 and HPV18 E57 ) . This experiment indicates 
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successful expression of the heterologous gene 
sequences from the recombinant virus, 
Stability of the HPV DNA insert 

For the recombinant virus to be of use clinically, 
5 it is important that the inserted sequences remain 

genetically stable over multiple virus passage, 
and the DNA insert was carefully designed to 
promote this genetic instability. To confirm the 
stability of the HPV information within the 

10 recombinant virus genome, the virus is subjected 

to 9 serial passages, (multiplicity of infection 
=10pfu/cell) in Vero cells. Subsequently 20 
plaque isolates are picked, and analysed for the 
presence of the correct HPV DNA insert by PGR 

15 analysis as described in Figure 22. The data 

obtained for recombinant virus A are shown in 
Figure 24. All 20 virus isolates retained the HPV 
information in the expected genetic arrangement 
indicating a considerable degree of genetic 

20 stability. 



Animal experiments 

The virulence of the recombinant virus is compared 
in animal experiments with that of the parental 
Wyeth strain. Groups of 20 mice are inoculated 
25 intranasally each with 10^ pfu of Wyeth strain or 

recombinant virus in a total volume of 20ij1 . Two 
mice are sacrificed at 1 day, 3 days and 5days 
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following inoculation, and the lungs dissected 
out. The amount of virus present in the lungs is 
then measured by grinding the tissue, and assay of 
the homogenate by standard vaccinia virus plaque 
5 assay- The results of such an experiment for the 

recombinant virus v9a.l (HPV infromation inserted 
at site A) are shovm in Figure 25, It can be seen 
that the recombinant virus retains the ability to 
replicate in mice, and that the level of virus 
10 produced in the lungs of the infected animals is 

similar to that seen with the parental Wyeth 
strain. 



Therapeutic Use 

A stock of the recombinant virus is prepared by 
15 infection of MRC5 cells, and adjusted to a 

concentration of not less than 10® pfu/ml. 20 \il 
of this material is applied to the arm of the 
patient, which is then scarified through the virus 
droplet with a bifurcated needle, according to the 
20 standard procedure used for vaccination against 

smallpox . 
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CLAIMS 

1 . A recombinant virus vector for use as an 
immunotherapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
5 virus which have sufficient sequence homology that 
recombination between them might be expected; 

wherein said pair of nucleotide sequences are 
arranged in said virus vector such that they are 
inverted with respect to each other to reduce the 
10 likelihood of recombination events leading to loss of 
part or all of said sequences and said virus vector is 
able to infect a mammalian host cell and express as 
polypeptide the heterologous nucleotide sequences in 
said host cell. 

15 2. A recombinant virus vector according to claim 1 
wherein at least one pair of nucleotide sequences 
encode part or all of human papillomavirus ( HPV ) wild- 
type proteins or mutant proteins immunologically 
cross-reactive with said wild-type proteins. 

20 3. A recombinant virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild-type proteins 
HPV16E7 and HPV18E7 or mutant proteins immunologically 
cross-reactive therewith . 
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4 . A recombinant: virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild-type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 

5 cross-reactive therewith. 

5 . A recombinant virus vector according to claim 3 
which comprises a further pair of nucleotide sequences 
which encode part or all of the HPV wild-type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 

10 cross-reactive therewith. 

5. A recombinant virus vector according to any one 
of the preceding claims wherein two or more nucleotide 
sequences of different said pairs may be fused 
together to form a single open reading frame. 

15 7. A recombinant virus vector according to claim 6 
wherein the fusions are via a single codon encoding a 
small neutral amino acid. 

8 . A recombinant virus vector according to claim 7 
wherein the amino acid is glycine. 

20 9. A recombinant virus vector according to claim 5 
in which the pairs of nucleotide sequences are 
arranged in the virus vecror according to any one of 
the options as shown in Figure 26. 
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10- A recombinant: virus vector according to claim 9 
which comprises 

a first open reading frame having a fused genetic 
sequence encoding part or all of the wild-type 
5 proteins E6 and E7 from HPV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild- type proteins E6 and E7 from HPV18; 

wherein the first and second open reading frames 
10 may be inverted with respect to one another whereby 
either: 

i) the E6 coding sequences of HPV16 and HPV18 
are both located between the E7 coding sequences of 
HPV16 and HPV18; or 
15 ii) the E7 coding sequences of HPV16 and HPV18 

are both located between the E6 coding sequences of 
HPV16 and HPV18; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross- 
20 reactive therewith. 

11. A recombinant virus vector according to claim 10 
wherein each of the first and second open reading 
frames has a corresponding promoter and the two open 
reading frames each with its promoter, are arranged 

25 next to each other in the virus. 

12. A recombinant virus vector according to claim 11 
wherein either : 
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i ) the promoters are located between the first 
and second open reading frames whereby the open 
reading frames are transcribed outwardly; or 

ii ) the promoters are located outside the first 
5 and second open reading frames whereby the open 

reading frames are transcribed inwardly. 

13. A recombinant virus vector according to any one 
of claims 9 to 12 which comprises 

a first open reading frame having a fused genetic 
10 sequence encoding part or all of the wild-type 
proteins E6 and E7 from HPV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; 
15 wherein the E6 coding sequences of HPV16 and 

HPV18 are both located between the E7 coding sequences 
of HPV16 and HPV18; and 

each open reading frame has a corresponding 
promoter, the promoters being located between the 
20 first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross- 
reactive therewith , 



25 



14, A recombinant virus vector according to any one 
of the preceding claims wherein either or both of the 
nucleotide sequences in a said pair of nucleotide 
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sequences are altered to make them less homologous 
than an equivalent pair of nucleotide sequences 
encoding wild-type HPV proteins. 

15. A recombinant virus vector according to claim 14 
5 wherein the alteration in nucleotide sequence does not 

result in an alteration of the encoded amino acid 
sequence . 

16. A recombinant virus vector according to claim 3 
wherein the wild-type proteins HPV16E7 and HPV18E7 are 

10 replaced with mutant proteins which are substantially 
homologous to said wild-type proteins and in which the 
residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine 

15 residues. 

17- A recombinant virus vector according to any one 
of the preceding claims wherein said heterologous 
nucleotide sequences may comprise part or all of the 
sequences shown in Figures 1(a) and 1(b). 

20 18. A recombinant virus vector according to any one 
of the preceding claims which is derivable from 
vaccinia virus. 

19. A recombinant virus vector according to any one 
of the preceding claims wherein the nucleotide 
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sequences are inserted into the virus vector at one or 
more neutral sites, the disruption of which by the 
insertion of the nucleotide sequences does not 
substantially adversely affect viral functions 
5 relating to the replicative ability of the virus in 
the mammalian cell . 



20. A recombinant virus vector according to claim 19 
which is derivable from vaccinia virus and wherein the 
neutral sites may be one or more of: 
10 A) the gap between SalIF17R and SalIF19R of 

strain WR comprising at least part of the sequence 
CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT; 

B) the gap between SalIF19R and SalIF20,5R of 
strain WR comprising at least part of the sequence 

1 5 TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA ; 

C) a region in SalIG2R of strain WR comprising 
at least part of the sequence 

TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA; 

D) a region in HindB3 - 5R of strain WR comprising 

20 at least part of the sequence 

TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA; 

E) a sequence having at least 90% sequence 
homology to those sequences A) to D ) identified above. 



21- A method for making a recombinant virus vector 
25 according to claim 19 or claim 20 which comprises 
inserting a said heterologous nucleotide sequences into 
one or more neutral sites in a virus vector, the 
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disruption of such a site by said insertion will not 
substantially adversely affect the replicative ability 
of the virus and wherein the neutral site has been 
previously identified by: (a) analysing a viral genome 
5 to identify open reading frames which are likely to 
encode functional genes; and (b) selecting sites between 
open reading frames for functional genes or sites within 
sequences for non- functional genes. 

22. A recombinant virus vector obtainable by the method 
10 of claim 21. 

23. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to manufacture a medicament for use as an 
immunotherapeutic or vaccine against a condition thought 

15 to be caused by HPV infection, for example cervical 
cancer . 

24. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to specifically activate cells of the immune 

20 system to HPV proteins. 
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Fig. la . 

AT C C ClrtTGGAC CAAAA GA GAAC TGCAATG TTT CAGGA C C CACAGGAGC 

I PWTKRELCCFRT HRSDP£S 
SHGPKENCNVSGPTGATQKV 

? M c J K R T A r 0 r p o K F. ? p. y. : 
" Starr of E6 coding region 



TACCACAGTTATGCACAGAGCTGCAAACAACTATACATGATATAATATTAGAATGTGTGT 12 0 

YHSYAQSCKQLYMI *Y*NVC 
r T V M H R A A N N Y T * Y N I R M C V 
r Q L Z I Z L Q I I I H - T t F r y v 

ACTGCAAGCAACAGTTACTGCGACGTGAGGTArATGACTTTGCTTTTCGGGATTTAT ISC 

TASNSYCDVRYMrLLFG lYA 
IQATVTAT-GI^LCFSGFMH 
C y. O Q L I, ?. P. Z '/ Y r F A F P D I_ " T 

TAGTATATAGAGATGGGAATCCATATGCTGTATGTGATAAATGTrrAAAGTTTTATTCTA 2 4 0 

*YIEMGIHMLYVINV*SFIL 
SI*RWESICCM**MFKVLF* 

V Y R D G N P Y A, C D K C L K F Y ? K 

aaattagtgagtatagacattattgttatagrttgtatggaacaacattagaacagca^ 30 0 

k l v s i d i i v i v c m e q h * n s n 
n^'-^v^tili-f^'wnnirta: 

^ ^ t E H Y T Y F L Y - - :. F - V 

ACAACAAACCGTTGTGTGATTTGTTAATTAGGTGTATTAACTGTCAAAAGCCACTGTG'rC 3 60 

TTNRCVIC*LGVLTVKSHCV 
Q Q T V V * F V N * V Y * L S K A T V £ 

>i K P 1 C D L L I R G r 2 O y P L ^ F 

'-^ - oj'iAoAAAAGCAAAGACATCTGGAGAAAAAGCAAAGATTCCATAATATAAGGGGTC 4 2 C 



^'^irv^v^ i. v^oj-v ^ GTA GTGTTGTTGGjnG-'-iT'GATCAAGAACACGTAGAGAAACG 2AGCTG i 4 30 

-• ? ■-■ 0 V : L A ; H c E H •-• E K, ? £ : 
- ?. £ M : V 1 1 ; I I K ^; 7 • R t; ? A ■.' 

— — £ — : — :j — ^ — :: — : — s — ^ — s e i e e i t ~ i • 
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Fig. 1a (Con f J. 



AAx CA 1 GCATGGAGATACACCTACATTGCATGAATATATGTTAGATTTGCAACCAGlAGAC 5 4 c 

NHAWRv-T-vT.:;,* lYVRFATRD 

- a ^ U I F T L H e: Y M r o p r - 

SCMEIHLHCMNIC* I C N Q R Q 
Star:: of E7 coding region 

AAC X J. CTCTACTGTTATGAGCAATTAAATGACAGGTCAGAGGAGGAGGATGAAATAGA 6 0 C 

N'SLLL*AIK*QLRGGG*r:R 
T r — L — V — c X E 2 L N D S S L Z E D F. i 

L I S T V M S N * M T A Q R R R M K • M 

TGGTCCAGCTGGACAAGCAGAACCGGACAGAGCCCATTACAATATTGTAACCTTTT^ 6 60 

WSSWTSRTGQSPLQYGNLLL 



-.5 EPDPAHYNTVTFr 



VQLDKQNRTEPITI L * P F'/A 

CAAGTGTGACTCTACGCTTCGGTTGTGCGTACAAAGCACACACGTAGACATTCGTACTT^ 

QV*LYASVVRTKHTRRHSYr 
— ^ — C 2 2 I L R T. r V O .q H V n T R - I , 

SVTLRFGCAYKAHT*TFVI.W 

GGAAGACCTGTTAATGGGCACACTAGGAATTGTGTGCGCCATCTGTTCTCAGAAACCiATA 

GRPVNGHTRN'CVPHLFSETI 
^ 2 L lMr;-Lr:-V-:D-rc;oyo . 

KrC*'WAH*FLCA?SVLRNHN 

'GA 
T R V 
P G * 
P G 
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Fig. lb 



ATCCCATGGCGCGCTTTGAGGATCCAACACGGCGACCCTACAAGCTACCTGATCTGrGCA 60 
IPWRALRIQHG3PTSYLICA 
SHGAL *GSNTATLQAT* SVH 

? A R F E D P T R R P Y K L P D 1. - - 

" Start of E6 cccixng region 

CGGAACTGAACACTTCACTGCAAGACATAGAAATAACCTGrGTATATTGCAAGACAGT^^ 1 2 C 
RN*TLHCKT*K*PVYIARQY 
GTEHFTARHRNNLCILQDSI 
£ L 1:1 T S L ODIEITCVYCKTVT. 

TGGAACTTACAGAGGTATrraAATTTGCATTTAAAGATTTAT^ 13 0 

W N L Q R Y L N 1 H L K I Y L W C I E T 
GTYRGI*IC:*RFICGV*RQ 
£ L I E )L E H F A F K r T. F V V Y R D ,S 

GrATACCCCATGCTGCATGCCATAAATGTATAGATTTTTATTCTAGAATTAGAGAATTAA 2 4 0 
VYPMLHAINV*IFILELEN* 
YTPCCMP*MYRFLF*N*RIK 
1 E a h h C H K ■ I D E Y S R T R F T. R 

GACATTATTCAGACTCTGTGTATGGAGACACATTGGAAAAACTAACTAACACTGGGTTAT 30 0 

DIIQTLCMETHWKN*LTLGY 
TLFRLCVWRHIGKTN ' HW^'I 
HYSDSVYGC TLEKLTNTGLY 

ACAATTTATTAATAAGGTGCCTGCGGTGCCAGAAACCGTTGAATCCAGCAGAAAAACTT^ 3 50 

TIY* *GACGARNR'IQQKNL 
QrlNKVPAVPETVESSRKT* 

H — L — L — : — £ L L E Z C i::^ £ L N p a F K r 

GACACCTTAATGAAAAACGACGATTTCACAACATAGCTGGGCACTATAGAGGCCAG^ 420 
DTIMKNDDFTT-LGTIEASA 
TP* * K T T : S C H S W A L * R P V ? 

a — L — tl E E B E Lj N I £^ ^- H Y R G r - h 

ATTCGTGCTGCAACCGAGCACGACAGGAACGACTCGAACGACGCAGAGAAACACAA^ 4 8 0 

I R A A T E H D R N D S r; D A E K H K Y 

F V L C p S T T G T T P T T Q R N T S I 
— E — :: E 1:1 E h 2 2 E E E C E B E Z - o v - 
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Fig.lb(Cont). 

AATATTAAGTATGCATGGACCTAAGGCAACATTGCAAGACATTGTATTGCATTTAGAGCC 5 4 0 

NIKYAWT*GNIARHCIAFRA 
- ^ S M H GFKATL Q^ I V L H L F P 

Y*VCMDLRQHCKTLYCI *S? 

^ Starr of £7 cooing region 

ccaaaatgaaattccggttgaccrrctatgtgacgagcaattaagcgactcagaggaaga 6 g 0 
?k*nsg*?smsra:krlrgr 
C Z : P V D L L c H e: 0 L P S K F f 

KMKFRLTFYVTSN*ATORKK 

AAACGATGAAATAGATGGAGTTAATCATCAACATTTACCAGCCCGACGAGCCGAACCACA 6 60 

KR*NRWS*SSTFTSPTSRTT 
U 2 FID G )L U a C d L parraepo 

TMK*MELIINIYQPDEPNHN 



ACGTCACACAATGTTGTGTATGTGTTGTAAGTGTGAAGCCAGAATTGAGCTAGTAGTAGA 7 2 0 

TSHNVVYVL*V*SQN*ASSR 

B a I U L C M C " K C F A R T F T. V V F 

VTQCCVCVVSVKPE LS * * * K 

AAGCTCAGCAGACGACCTTCGAGCATTCCAGCAGCTGTTTCTGA^ 7 8 0 

KLSRRPSSIPAAVSEHPVLC 

S 2 h DnL RAFOOLFLNTL^FV 

AQQTTFEHSSSCF*TPCPLC 

GTGTCCGTGGTGTGCATCCCAGCAGTAACCCGGGTGA S 4 0 

VSVVCIPAVTRV 
C ? W C A S Q O * P G * 
VRGVHPSSKPG 
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Fig. 3. 
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Fig A 
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Fig. 5. 
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Fig. 6. 
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Fig. 7. 
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Fig. 8. 
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Fig. 9. 

S a I F 1 7 R 

GflTTGTGTGCflTGCGTGGTflGflTGTTTGGflGflflflTGflGflflflCTGTTTTCTRGflTGGflfiflT 

LCflCUUDUURME^LFSRUKV 

RTTGTTTFICGflGCTflTTflflflCTGTTTflTTflflTGflTCRCflTGCTTGflTRflGflTflflflflTCTfl 

CLRfliKLFIHDHnLDICIKS! 

TRCTGCflGflflTflGfiCTRGTflTRTGTGGflaRTGTCRTRGflRflGTTflflflflGTTflflTGRGflGC 
LOHRLUVUEnS* 

RRRflflTflTRTRflGGTTGTflTTCCflTRTTTGTTflT rrrrrCrGTRRTRGTIflGRRflflflTRC 

Site fl 
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801 
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TGGRTRTRRRGflflTTTGCTGRCTGCRTGTRCTRTTTTTTRCflTTRCTRCRTTGGCTRCGG 98 1 

GV»:;EFRDCnVVFLHVV i GVG 
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Fig. 10. 



S a I F 2 OR 

TGCCflflGGTTflGflTGTflflTGGTflflCGfllfiflCflCflflflflTGTGflflCGCTGCCCflCCTCflTflC 1110 
FiiCURCHGMDHTKCERCPPHT 

flTRTflCCRCflflTCCCflflTTRTTCTRfllGGflTGTCflTCflflTGTflGBflfiflTGCCCflflCCGGfi 1 2QC 

VTTlPllLnDUIMUEHflOPD 

TCflTTTGRTflfiGGTflflflGTGTRCCGGflflCflCflGflflCflGTflflflTGTTCGTGTCTTCCTGGT 12 60 

H L ! R • 

Site B 

TGGTRTTGTGCTRCTGRTTCIimLflGflCTGflRGflTTGTTGflflflTTGTGTflCCflflflRflGG 1 320 

Sa 1 F20 . 5R 

RGflTGTCCRTGCGGflTflCTTTGGTGGflflTRGflTGRRCRflGGRRflTCCTRTTTGTRRfiTCG 13 80 

HMKEiLFUHR 

TGTTGTGTTGGTGflRTflTTGCGflCTRCCTRCGTfiflTTRTflGflCTTGflTCCflTTTCCTCCR M^O 

UULUHIfllTVUl IDLIHFLH 

TGCflflRCTRTCTflflflTGTflflTTfiRTTflTGfiTTTTGflTGRTRflTGTTflCCRTflCflTTRTRT 1 5 00 

FlHVLHUtHVGFDDHUT!HVt 

CGCTflCTTGGTTRGTGTRTTRTTCRGTRTGRRGRCCTRTTRRTRRTTRCTTRTCTTTTGR 1560 
RTULUVVSU* 

CGRTCTTGTTRTRflTTRTRRTRTRRRRflTRCTTfiTGGCRTRGTflRCTCRTRflTTGCTGRC 1620 
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Fig. 11. 



Gap Ueight. 3.000 fluerage Hatch. 0.5-^0 

Length Lieight. 0.100 fluerage Hismatch. -0.396 

Qual i ty . 113.0 Length. 199 

Rat i o . 0 . 608 Gaps , 4 

Percent Similarity. 56.354 Percent Identity-. 29.282 

The top sequence is SalG2R and the bottoa sequence is the yeast GK 



Start of G2R 

1 nSGIUI::SI ILS GPSGLGKT fllfllCRLnGI V. . . LOLU CP I PL DFL UL HERE ^7 



1 ... ■ SRPI Ul S GPSGTG^S TLLKKLFREVPDSFGFSUSSTTRTPRflGEUH 6 

4 8 GUDVHVUHRERlUfcCGIflflGNFLEHTEFLGH! VGTSKTRUHTRRIHHRICU 9 7 

4 7 GKDVHFUSUDEFiCSn 1 KNHEF lEUflOFSGHVVGSTUflSUKqUSKSGKTCl 96 

98 nOLH I DGURSLKH . TVLnPVSUV i RPTSLKnUETKLRCRHTERHDE I HRR M6 

9 7 LD lOnOGUKSU^CRIPELNRRFLF I flPPSUED LKJCRLEGRGTE TEE S \ HKR 116 

M7 U I LRKTOnDERHERGLFOT 1 ! I EDDUHLRVSKL 1 0 - I LQOR I RH VF H T H 194 

147 LSflRORELflVR.ETGRHDKUIUHDCLDKflVI^ELKDFlFflEK 186 
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Fig. 12. 



HlndB3R 

CTAAGAACACGTATACGGCAGCAGCTTCCTTTATACTCTCATCTTTTACCAACACAAAGG 507 
LRTRIRQQLPLYSHLLPTQR 

GTGGATATTTGTTCATTGGAGTTGATAATAATACACACAAAGTAATTGGATTCACGGTGG 567 
VDICSLELIIIHTK* 

GTCATGACTACCTCAGACTGGTAGAGAATGATATAGAAAAGCATATCAAAAGACTTCGTG 627 

TTGTGCATTTCTGTGAGAAGAAAGAGGACATCAAGTACACGTGTCGATTCATCAAGGTAT 687 

Site D 

ATAAACCTGGGGATGAGGCTACCTCGACA TACGTG TGCGCTATCAAAGTGGAAAGATGCT 747 

GTTGTGCTGTGTTTGCAGATTGGCCAGAATCATGGTATATGGATACTAATGGTATCAAGA 807 



AGTATTCTCCAGATGAATGGGTGTCACATATAAAATTTTAATTAATGTAATAGAGAACAA 867 



ATAATAAGGTTGTAATATCATATAGACAATAACTAACAATTAATTAGTAACTGTTATCTC 927 



I I I I I lAACTAACCAACTAACTATATACCTATTAATACATCGTAATTATAGTTCTTAACA 987 



TCTATTAATCATTAATTCGCTTCTTTAATTTrrr/irAAACTAACAITGTTAATTGAAAAG 1047 

HindB4R 

GGATAACATGTTACAGAATATAAATTATATATGGA I i I I I I I AAAAAGGAAATACTTGAC 1107 

MDFFKKEILD 

TGGAGTATATATTTATCTCTTCATTATATAGCACGCGTGTTTTCCAA I I I I ICCACATCC 1167 
WSIYLSLHYIARVFSNFSTS 
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Fig. % . 



Hindlll EcoRI 



SnaBI Hpal Hindlll Sail Ncol Smal SnaBI 



\1. 




Hindlll 
enos 



Synthetc otigo 
S7-S8 




puciia 



I viti i-aft^r^i.li i-ffitllHiiii ii;^-^ 



ampR 

Cut wiih Hindlll and EcoRI 



SnaBI Hpal Hindlll Sail Ncol Smal SnaBI 



pH6 promoter 



PGR product digested 
with Sail and Hindlll 




plMMB3 

ampR 



Cut with Hfndllt and Sail 



Cut with Ncol ana Sa 



p7.5 oromoter 



PGR product digested 
with Sail ana Ncol 




SnaBI Hpal Hinall! Sail Ncol SmalSnaB! 




SnaBi Hpal HindUl Sail NccI Smai SnaBI 



pH6 

piMMB4a 



amoR 



p7.5 

plMMB14b 



t " i- f -s r .w;M ,riii^v,i, i 
ampR 
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SnaBI 
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ampr 
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Hindm^^^NcoI 



be 



Smal 



SnaBI 



pIMMBKb 
ampr 



HPV16E67 



Hindm/Smal 
fragment from 

plMS7.2 



Digest with 
Hindlll/Hpal 



Digest with 
Ncol/Smal 



Hindm 



Sail 

SnaBI HPV16E67'|1/i5isnaBI SnaBI^?^ 



Sail 



HPV18 E67 

V ////////A 

Ncol /Smal 
fragment from 
pIMSe 2 



plMS12 
ampr 



/Smal 



Ncol 




Smal 



HPV18 E67 



SnaBI 




plMS13 
ampr 



Digest with 
Sail and Smal 



Digest with 
Sail and Smal 



Sail 



cr,,DT HindUJlSmalc dt 

SnaBI HPV16E67 i I SnaBI 



idnjl: 



plMS12 
ampr 



Sail 

Ncol Smal 
^ I HPV18 E67 



Sail 



r OT Hindlll 
SnaBI HPV16 E67 



Fiq. 15. 



Ncol Smal 
HPV18E67 -SnaBI 



plMS15 



amor 
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Fig.16. 



Sal! 



SnaBf Hpal 



Hindlll 




Ncol 



Smal SnaB! 



HPV16E5-7 HPV18E6-7 

pH6 p7.5 



pIMSiS 



JC:::-:.>>: :::;^:V;::::;:-'::i:>':;:::-> 



ampR 



PCT/GB92/00424 



Cut with SnaBI 
Isolate E6-7 
tragment 



Unique srte 



neutral site 

pIMMBS a-d 



ampR 



Cut at unique srte 



Hpal 



Sail 

Hindlll Ncol 



Smal 



neutral 
sne left 



■ a a - - 
i ■ ■ ■ ■ 



HPV16 E6-7 p7.5 



plMMB9 a-d 



neutral 
site ngh! 
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Fig. 17. 



pIMMB9 a-d 



I L- HPV16E67 HPV18E67 



homoiogous 
recomdmation 





Recombination 
piasftiKj 



-C5S2ZS3ESE2aa3- 

L R 



Viojs genome 
— - containing neutral 
site 



Recombination between 
plasmid and vinjs genome. 
Recombinant selected by probing 
with HPV-speific sequences. 



^ HPV16E67 HPV18E67 R 



Recombinant virus 
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Fig. 18. 

MB 0 3 TCAGGATCCCACATGAGCGAAAAATACATCG 

M5 0 7 TCAAAGCTTATTACGATACAAACTTAACGGA 

NS09 TCAGTTAACATPJiAAAG^CAACGCCCZGCAG 

MB 1 4 TCAAGGGCCTCTATATAGTAATACCAATACTC 

MBl 5 TCAGTCGACTTACAAACAACTAGGAAATTGG 

MBl 6 TCAGAATTCTATGTACAGAGGTCTATTAGGC 

MB 1 7 TCAAAGCTTGTATGAGGTGGGCAGCGTTCAC 

MB 18 TCAGAATTCTTAATTATATTGTCGGCCGTGG 

MBl 9 TCAGCATGCATGATCCGTTAGCTTTGGGCTC 

MB22 TCAGAATTCGAAGCTCTAGAGTATCTTAGCG 

MB 2 3 TCAAAGCTTTCCTGTATTATATGGGATGTGG 

MB 2 4 TCAGAATTCATTGATGGATGAGATATACAGC 

M3 2 5 TCAAAGCTTTCACAAAATCG 

MB 2 6 TCAGAATTCCACGTATACGGCAGCAGCTTCC 

MB 2 7 TCAAAGCTTTGTTCTACGTCCATTTTCAAGC 

M332 TCAGTCGACATACCAATACTCAAGACTACGA 

MB 3 3 TCACCATGGATTGCTATTGATTGAGTACTGTTC 

MB 3 5 AGTACCTTATAATACGTAATAATCTGGTAG 

MB 3 6 AATCTTCAGTCTGTTAACAATCAGTAGCAC 

MB 3"' TACAACGAAGCTAGGCCrCAACCATTTTAA 

MB 3 8 TTTGATAGCGCATACGTATGTCGAGGTAGC 

MB 3 9 AATAGGTATATAGTTAACTGGTTAGTTAAA 

501 ATCCCATGGCGCGCTTTGAGGATCCAAC 

502 TCACCCGGGTTACTGCTGGGATGCACACCAC 
30 5 ATCCCATGGACCAAAAGAGAACTGCAATGTTTC 
SO 6 TCACCCGGGTTATGGTTTCTGAGAACAGATG 

S07 AGCTCTACGTAGTTAACAAGCTTGTCGACCCATGGCCCGGGTACGrA 

SOS AATTTACGTACCCGGGCCATGGGTCGACAAGCTTGTTAACTACGTAG 

S20 GAAACCCAGCTGGGAATCATGCATGG 

S 2 1 GAAACACAAGTAGGAATATTAAGTATG 

S22 GATCTCTACGGTTATGGGCAATTAAATGAC 

S i_ J oA^^GTTCTAv-ToTCACGo'^CAATTAAGCGAC 

ATGTATAGATTTCTACAGTAGAATCAGAGAATTAAG 
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Fig. 19. 



rATrGTTTACGAGCTATTAAACrGTTTATTAATGATCACLA.TGCrTGA^ 
I C L R A I K L F : N D H M L D K I K S 
^VYE1LNC1LM:7CL:R*^;L 
1 F T S Y * T V Y - * S H A * * D K : Y 

ATACTGCAGAATAGACTAGTATATGTGGAAATGTCATAGAAAGTTAAAAGTTAATGAGAG 
I LQNRLVYVEMS 'KVKS * * E 
^CRID*YMW KG HRKLKVNES 
TAE'TSICGNVIES *KLMRA 



CAAAAATATATAAGGTTGTArrCCATATTTGrTATTTTTTGTGTAATAGTTAG;^^ 180 

QKYIRLYSIFVIFSVIVRKI 
KNI*GCIPYLLFF1* *LEKY 
KIYKVVFKICYFFCNS - KMT 

CATTCGATGGTCTATCTACCAGATTATTATGTGTTATAAGGTACrrTTTGTCATAATAAA 2 4 C 

HSMVYLPDYYVL*GTFS HNK 
IRWS I YQIIMCYKVL-F L I I N 
FDGLSTRLLCVIRYFFS * * T 

CrAGAGTATGAGTAAGATAGTGTriTTCAAAACATATAAATCTAAAATTGATGG^^ 300 
1EYE*DSVFCNI*I*N*WMR 
* SMSKIVFFKTYKSKIDG*D 
RV*yR*GFSKH:NLKLMDEI 

tatacagctattaatttcgaaaatatattrraatctgataactttaaacatggat^^ 3 60 

yta:nfenif*sdnfkhgfl 
^qi^i-iskiyfnlitlnmdf* 
-sy*frkyili**l*twifd 

ATGGTGGTTTAACGTTTTAAAAAAAGATTTTGTTATTGTAGTATATGATAATAT^ 4 2 0 

M V V * R F K K R F G Y C S r * * Y * K 

^ L K K D F '/ I V '/ Y D : K R 

G G L T F * K K I L L L * Y M I I L p; 0 

ATGGATATAAAGAATTTGCTGACTGCATGTACTAITTrrTACATTACTACAT^ 46 0 

MDIKNLLTACTIFYITTLAT 
- * ?^ I C * L H V 1 F F T 1 1 H L *R 

G Y K E F A G C M Y F 1 H Y Y : G Y G 

GCAGATATACGTACTCCGCCACCAACGGGTGATGTGACAAGGGAGAATATCrr GATAJ^GA 54 0 

^ ? T P ? ? T G H V T R E N : 1 I R 
^' I Y L L R H Q R '/ M - Q G R : £ * ' E 

R Y T Y S A T N G S C D K G E Y L G K R 

GGGATAATGAATGTTGTAATCGGrGTGGAGGTGGAGAATTTGGGAAGGTTAGATGrA^TG 60 0 

^ ' ' N* V I G \^ H L E N 1 ? R L D M 

^ * S M L " S V S G W R I G Q G * ' W 

^' C N R G ? ? G E F A K R G V G 

GTAACGATAACAGAAAArGTGA^CGGTGCGGAGGTGATACATATAGGAGAATGGGA.a.TTA 660 
G : 7 G V A A H G I H I P G S : L 

* ' n :< M • G 1 P T S Y I Y H ^J F \' Y 

- N G K G E R G ? P - G : T T 1 ^ 1 I 
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Fig.19(Cont). 



TTCTAATGGATGTCATCAATGTAGAAAATGCCCAACCGGATCATTTGATAAGGTAAAGTG 7 20 

F*WMSSM*KMPNRII * 'GKV 
S G C H Q C R K C ? T G S F D K V K C 

LMDVINVENAQPDHLIR'SV 

TACCGGAACACAGAACAGTAAATGTTCGTGTCrrCCTGGTTGGTATTGTGCTACTGATTC 7 8 0 

YRNTEQ*MrVSSWLVLCy'F 

TGTQNSKCSCLPGWYCATDS 
PEHRTVNVRVFLVGIVLLIL 

TTCACAGACTGAAGATTGTTGAAATTGTGTACCAAAAAGGAGATGTCCAT^ 84 0 

FTD*RLLKLCTKKEMSMRIL 
SQTEDC*NCVPKRRCPCGYF 
HRLKIVEIVYQKGDVHAOTL 

TGGTGGAATAGATGAACAAGGAAATGCTATTTGTAAATCGTGTTGTGTTGGTGAATATTG 900 
WWNR*TRK5YL* IVLCW'IL 
GGIDEQGNPICKSCCVGEYC 

VE *mnkeilfvnrvvlvn:a 

CGACTACCTACGTAATTATAGACTTGATCCATTTCCTCCATGCAAACTATCTAAATGT^ 960 
RLPT*L*T*SISSMQTI*M* 

DYLRNYRLDPFPPCKLSKCN 
TTYVI IDLIHFLHANYLNVI 

TTAATTATGATTTTGATGATAATGTTACCATACATTATATCGCTACTTGGTTAGTGTAT^ 1020 
LIMILMIMLPYI ISLLG*CI 
*L*F* **CYKTLYRYLVSVL 
NYDFDDNVTIHYIATWLVYY 

ATTCAGTATGAAGACCTATTAATAATTACrrATCTTTTGACGATCTTGTTATAATTATA^ 1080 
IQYEDLLIITYLLTILL*L* 
FSMKTY**1LIF*RSCYNYN 
SV*RPINNYLSFDDLVIIII 

tataaaaatacttatggcatagtaactcataattgctgacgcgataaattcgtaata^ 114 0 

ykntygivthnc*rdkfvii 
:kilma**liiadains**s 

* K Y 1 W H S N S * 1 L T R * I R N N 1 
TGrTTTGTTCAAArTTTTATAAGGAATCTACAGGCATAAAAATAAAAATATP^ 1200 

c F V Q : r I R ^; L c A * K * K V N 1 ' 

'v' L F :-: F L * G I : R H K N K N I I V >; 
F C S ^^ F Y K E S T G I K I K I * r I : 

TATACTCTTACAGCGCGCCATCATGAATAA'GAGCAGTGAATTGATTGCTG 12 60 

Y T L T A R H H E * 2 C * : D C 



sysa?S":tavn*ll 
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Fig. 20. 



atatttggtattaccgcattaattatattgtcggccgtggcaattttctgtattacatat 6 0 

ifgitaliilsavaifcity 
ylvlph^lycrpwqfsvlh: 
iwyyrinyivgrgnflyyil 

tatatatataataaacgttcacgtaaatacaaaacagagaacaaagtctagari-x":":'gac 120 
yiynkrsrkyktenkv* ifd 
lyiinvhvntkqrtksrflt 
yi**tft*iqnreqsldf*l 

ttacataaatgtctgggatagtaaaatctatcatattgagcggaccatctggtttaggaa 13 0 

lhkclg**nlsy*adklv* e 

yinvwdskiyhiertiwfrk 
t*msgivksiilsgpsglgk 

agacagccatagccaaaagactatgggaatatarrrggatttgtggtgtcccataccact 2 4 0 

rqp*pkdygnifgfvvshtt 
dshsqktmgiy1,dlwcpipl 
taiakrlweyiwicgvpyh* 

agatttcctcgtcctatggaacgagaaggtgttgartaccattacgttaacagagaggcc 300 
rfprpmeregvdyhyvnrea 
dflvlwnekvlititlter? 
isssygtrrc*lplr*qrgh 

atctggaagggaatagccgccggaaactttctagaacatactgagtttrraggaaatat^ 3 60 

iwkgiaagnflehteflgni 
sgre*?petf*nilsf*eif 
legnsrrklsrty*vfrkyl 

tacggaacrrctaaaacagctgtgaatacagcggctattaataatcgtatttgtgtgatg 4 2 0 

ygtsktavntaainnri cvm 

r E L L K Q L * : Q R 1 1 I I V F V * W 
RNF*NSCEYSGY*'SY1CDG 

'GATCTAAACATCGACGGTGTTAGAAGTCTTAAAAATACGTACCTAATGCCTTACTCGGTG 4 8 0 

DLNIDGVRSLKNTYLMPY3V 
I^TSrVLEVLKIRT^CLTRC 
SKHRRC* KS*KYVPNALLGV 

rATATAAGACCTACCTCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACACTGAA 5 4 0 

Y : R ? T S L K M V £ T K L R C R N T Z 
: * D L P L L ?: W 1 R ? S ? V E r 1 K 

: K T Y L S - N G * 0 Q A S L - K H - S 
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Fig.20(Cont). 

GCTAACGATGAGATTCATCGTCGCGTGATATTGGCAAAAACGGATATGGATGAGGCCAAC 6C0 

A D E I H R R V' : L A K T Z- M D E A .\' 

1 T M R F I V A * Y W Q K R I a M R P 7 
* * D S S S K Z 1 Z Y. y< G G * G Q P. 

GAAGCAGGTCTATTCGACACTATTATCATTGAAGATGATGTGAATTTAGCATATAGTAAG 660 

H: A G L F D T I I I E D D ^ L A Y S K 

KQVYSTLLSLKMM' I *HIVS 
SRSIRHYYH*P*C^^*^^ 

TTAATTCAGATACTACAGGACCGTATTAGAATGTATTTTAACACTAATTAGAGACTTAAG 7 2 C 

LIQILQDRIRMYFNTN*RLK 
* FRYYRTVLECILTLIRDLR 
S D T T G F Y * N V F ' H * L E T • D 



ACTTAAAACTTGATAATTAATAATATAACTCGTTTTTATATGTGGCTATTTCA^ 7 80 

T*NLriNNITRFYMWLFQRL 
LKT* *LII*LVFICGYFNV* 
LKLDN**YNSFLYVAISTSN 

ATGTATTAGTTAAATATTAAAACTTACCACGTAAAACTTAAAATTTAAAATC^ 8 4 0 

MY*LNIKTYHVKLKI*NDIS 
CIS*ILKLTT*NLKFKMIFH 
VLVKY*NLPRKT*NLK*YFI 

TTGACAGATAGATCACACATTATGAACriTTCAAGGAC^ 900 
LTDRSHIMNFQGLVLTDNCK 
* QIDHTL*TFKDLC*LTIAK 
DR*I7HYELSRTCVN*QLQK 

AATCAATGGGTCGTTGGACCATTAATAGGAAAAGGTGGATTCGGTAGTATTTATACTACT 960 

NQWVVGPLIGKGGFGSIYTT 
I N G S L D H * * E K V D S V V F I L L 
SMGRWTINRKRWIR* YLYY * 



AATGACAATAATTATGTAGTAAAAATAGAGCCCAAAGCTA 

N D N N Y V V K I E ? ?: A 

* Q * 1 C S K N R A Q S S 



SUBSTITUTE SHEET 



VV O 92/16636 




PCT/GB92/00424 



Fig. 21. 

ACCATCGAGGTAACCACCrCTCTGGu^AGACAGCGTGAATAATGTACTCATGAAACGrrT 6 0 

T I E V T T S L £ 0 S V N N V L M K R L 
?SR*FPLWKrA*IMYS*NVW 
HRGNHLSGRQRE 'CTHETRG 

GAAACTATACGCCATATGTGGTCTGTCGTATATGATCATTTTGATATTGTGAATGGTAAA 12 0 

ETIRHMWSVVYDHFDIVNGK 

klyaicglsymiilil*mvk 
nyt?yvvcri*sf*ycew»r 

gaatgctgttatgtgcatacgcatttgtctaatcaaaatcttataccgagtactgtaaaa 1 8 q 

h:ccyvhthlsnqnlipstvk 
navmciriclikilyrvl*k 
mllcayafv*sksyteyckn 

acaaatttgtacatgaagactatgggatcatgcattcaaatggattccatggaagc^ 2 4 c 

TNLYMKTMGSCIQMDSMEAL 
QI CT*RLWDHAFKWIPWKL* 
KFVHEDYGIMHSNGFHGSSR 

GAGTATCTTAGCGAACTGAAGGAATCAGGTGGATGGAGTCCCAGACCAGAAATGCAGGAA 300 
EYLSELKESGGWSPRPEMQE 
SrLAN*RNQVDGV?DQKCRN 
*RTEGIRWMESQTRNAGI 

TTTGAATATCCAGATGGAGTGGAAGACACTGAATCAATTGAGAGATTGGTAGAGGAGTTC 3 60 

FEYPDGVEDTESIERLVEEF 
LNIQMEWKTLNQLRDW*RSS 
* ISRWSGRH*IN*EIGRGVL 

TTCAATAGATCAGAACrrCAGGCTGGTGAATCAGTCAAATTTGGTAAlTCTATTAATGTT 4 20 

FNRSELQAGESVKFGNS IN'V 
S ^ i:^ Q N F R L V r; Q 3 N L V I L L M L 
IRTSGW*ISQIW*FY*C* 

AAACATACATCTGTTTCAGCTAAGCAACTAAGAACACGTATACGGCAGCAGCTTCCTTTA 4 80 

KHTSVSAKQLRTRIRQQLFL 
NIHLFQLSN'*EHVYGSSFL^' 
-"ICFS^ATKNTYTAAASFI 

TACrCTCATCTTTTACCAACACAAAGGGTGGATATTTGTTCATTGGAGrrGATAAT^^ 54 0 

YSHLLPTQRVDICSLELIi: 
"LIFYQHKGWIFVHWS* 
L S S F T N T K G G Y L F : G V D N N 

CACACAAAGTAATTGGATTCACGGTGGGTCATGAGTACCTCAGACTGGTAGAGAATGATA 6 00 

H T K * L D S R W V M T T S D W * R M : 
T 0 S N W I H G G S * L ? Q T G R 'e *Y 
H K V 1 G T V G H 0 Y 1 R L V E C I 

TAGAAAAGGATATCAAA-^GACTTGGTOTTGTGCATTTCTGTGAGAAGAAAGAGGACATCA o60 

^< S : S K D F 'v' L C I S V R R K R T S 
R K A : Q K T S C C A F L ' E E * R ^ 3 H ^ : 
E K H I K R L R '/ '/ H F C E K K E C I * K 
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Fig.21(Cont 1). 

AGTACACGTGTCGATTCATCAAGGTATATAAACCrGGGGATGAGGCTACCTCGACATACG 720 
STRVDSSRYINLGMRLPRHT 
VHVSIHQGI*TWG*GYLDIR 
YTCRFIKVYKPGDEATSTYV 

TGTGCGCTATCAAAGTGGAAAGATGCTGTTGTGCTGTGTTTGCAGATTGGCCAGAAT^ 7 80 

CALSKWKDAVVLCLQIGQNH 
VRYQSGKMLLCCVCRLARIM 
CAIKVERCCCAVFADWPESW 

GGTATATGGATACTAATGGTATCAAGAAGTATTCTCCAGATGAATGGGTGTCAC^ 84 0 

GIWILMVSRSILQMNGCHI * 
VYGY*WYQEVFSR*MGVTyK 
YMDTNGIKKYSPDEWVSHIK 

AATTTTAATTAATGTAATAGAGAACAAATAATAAGGTTGTAATATCATATA 900 
NFN*CNREQIIRL*YHIDNN 
ILINVIENK**GCNII *TIT 
F *LM* *RTNNKVVISYR<2* L 

TAACAATTAATTAGTAACTGTTATCTCTTTTTTAACTAACCAA^ 960 
*QLISNCYLFFN*PTNYIPI 
NN*LVTVISFLTNQLTIYLL 
TIN**LLSLF*LTN*LYTY* 

AATACATCGTAATTATAGTTCTTAACATCTATTAATCATTAATTCGCTTCTCT 1020 
NTS*L*FLTSINH*FASLIF 
IHRNYSS*HLLIINSLL*FF 
YIVIIVLNIY*SLIRFFNFL 

TATAAACTAACATTGTTAATTGAAAAGGGATAACATGTTACAGAATATAAATTATATATG 1080 
YKLTLLIEKG*HVTEYKLYM 
IN* HC*LKRDNMLQNINYIW 
* TNIVN*KG:rCYRI»IIYG 

GArTTTTTTAAAAAGGAAATACTTGACTGGAGTATATATTTATCTCTTCATTATAT^ 1X40 
OFFKKEILDWSIYLSLHYIA 
* F 1 K R K Y L T G v' Y I Y L F I I * H 
f'F^KGNT^LZYIFISSLYS**? 
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Fig.21(Cont2) 



CGCGTGTTTTCCAATTTTTCCACATCCCATATAATACAGGATTATAATCTCGTTCGAACA 1200 

RVFSNFSTSHIIQDYNLVRT 

ACr?IFPH?I'YRri:SFEw 
RVFQFFHIpvfj-r.jjj^.gj^^'-j^. 

TACGAGAAAGTGGATAAAACAATAGTTGATTTTTTATCTAGGTTGCCAAATTTATTCCAT 1260 
YEKVDKTIVDFLSRLPNLFH 
TRKWIKQ-LIFYLGCQIYsi 
RESG-NNS^FFI-VAKFIpv 



ATTTTAGAATATGGGGAAAATATTCTACATATTTATTCTATGGATGATGCTAATACGAAT 

ILEYGENILHIYSMDDANTN 
F'NMGKIFYIFILWMMLIRI 



FRIWGKYSTYLFYG*C 



Y E Y 



1320 



ATTATAATTTTTTTTCTAGATAGAGTATTAXATATTAATAAGAACGGGTCATTTATACAC 1380 

^-IfS"LDRVLNINKNGSFIH 
L'FFF'IEY'ILIRTGHLv- 

YNFFSR'SIKY'-ERVIYTQ 



1440 



AATCTCAGGTTATCATCATCCAITAATM3UUgMaUVTATGTAT^^ 

NLRLSSSINIKEYVYQLVNN 
ISGYHHPLI*KNMYIN«LIM 
SQVIIIH«YKRICISIS*»» 

GATCATCCAGATAATAGGATAAGACTAATGCTTGAAAATGGACGTAGAACAAGACATTrr 1500 
DHPDNRIRLMLENGRRTRHF 
^^QIIG'D'CLKMDVEQDII 
SSR**DKTNA»KWT»NKTFF 
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Fig. 22. 



(a) 



Vaccinia virus 
H6 promoter 



Vaccinia virus 
7.5K promoter 



Vaccinia virus DNA 



HPV16 E6-7 



\.7 



HPV18 E6-7 



Vaccinia vims DNA 



MB17 



1413 bp 



MB15 



MB32 



1569bp 



MB16 



The sequences present at the 5' end of PGR primers Ml 5 and MB32 
overlap by 6 nucleotides 



(b) 




^2kb 
- i.Skb 



Size marKers 



MB32/MB16 MBl 7/MB i 5 

PGR Drimers 



SUBSTITUTE SHEET 




PCT/GB92/00424 



Fig. 23. 
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(a) 



Fig. 2 A 
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Fig. 25. 
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Fig. 26. 
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Fig. 26 (Conf) 
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